.X O U S3 S£. «J» « O i- J ' O W E. 



0 ^ ZC02 



Figure 1 



Vfcxflt 1 to 3640 



CGGGGGGAGTGGGGAGGAGGGGGGTCGGCCGCCGCAGCCATS 



ATCCCAT 

a s a 17 



MEAN'WTAFLFQAHE 

CTTCCTOOCC^^ 

HQQQAAQNSLLPLLSSAVEPPDQKPLLPIP47 

ATTACTCAGAAACCTCAGGCTGCACCAGAAACATTAAAGGATGCCATTGGGATTAAA 

ITQKPQ AAPETLKDAIGIKKEKPKT8 F V C T 77 

TACTGCAGTAAAGCATTCAGGGACAGCTATCACCTGAGGCGCCATCAGTCCTGCCA 

YCSKAFRDSYHLRRHfiflrw TGXKLVSRAKK 107 

ACCCCCACCACGGTGGTTCCCCTTATCTCCAC^TTGCTOGGG 

TPTTVVPLISTIAGDSSRT8LVSTIAGILS137 

B „ m „ ATGCCTGTGCCCCAGTCTGTCAAGAAACCCAOT 
8 8A8TTAMPVPQSVKKPS 167 

AAGCCTGTCAAaAAaAACCACGCCTGTGAGATGTGTOGQAAGGC 

KPVKKHHA CEMCOKAFttPVYH^ MRgKLflB 8 



GACCyuUUU^CCTTTGAGTGTCCTATTTGTAAT^^ 

D E K P F E CP ICNQRFKR ie nRMTY HVR »H E 



197 

G G 227 
ATOKCA^COTATACT^^ 

I T K P T T C8YCgKgrffRP DHLfifi BV KH v H 8 T 257 
GAAAC^OXTTCAAATGCCAAACGTG^ 

E R P F K CQTCTAAFAT lt n RLRTHMVnii E C K V 287 
TCATGTAACATCTGTGGGAAGCTCTTGAGTGCAGCATATATCACGAGCCA 

8 CHICGKLLSAAYTTHHI.1CTH GQSQSZKCM 317 

ACGTGCAAACAAGGCATCAGCAAAACGTGXJVTGAGTGAQGAGACCAGCA 

TCKQGISKTCMSEETSHQKQQQQQQQQQQQ347 

* •***•#*•*## 

CAACAACAACAACATGTGACAAGCTCGCCAGGGAAGCAGGTAGAGACACTGA^ 

QQQQHVTSWPGKQVETLRLWBBAVKARKKE377 
* * * « 

GCTGTCCAACCTGTGCCAAACCTCCACGGCTGCTACGACACCAGTGACTCTCACT 

AANLCQTSTAATTPV TLTTPFKITS SV88G 407 

ACTATGTCAAACCCAGTCACAGTQGCAGCTGCAATGAGCATGAGAAGTCCAGTAAATGTCT^ 

TMSNPVTVAAAMSMRSPVHVSSAVKITSPL 437 

G(^TOACCTCACCTTTAACACTCACCACCaaWTCAA 

A M T S tTTPVMLPTPVTAPVHIAHPVTI 467 

ACATCTCCAATOAACCTGCCCACTCCTATGACATTAGCTGCCCCTCTCAATAT^ 

T S P M H I P'TPMTLAAP LN ZAMRPVE SMP 



F L P 497 

CAAGCTTTGCCTACGTGACCGCCTTGGTAAACAGTATC 

QALPTSPPW 506 

GATTAAAGCAAAAAJSCAGACTATGAAATTGGGAGGTTTTAOT 
AAAGTAGGGTATATGTGTAACTTATCACTGGACCACTTTAGTTTAC^ 
GCTGGCAAGACGAAATGCCAGAATTAAAACCAATCATAAAACCCATTTCAAAATAAAAAA 
ACAACAGAATCATTT TATTGTAAA^CTAG CAGAGTTCTTCCCTCTCTAC^ 
GAGAGCTAGTCTAGCATTGTCTCTGGTTTTG^ 
TACCTTTCGGAATATGl^AATTTCAGGCAGCATTCCCTATGG^ 
TCTAACTTCAACATATAATAGGGATTGTGACCTGATATTTGGAGAT^ 
^AGTTCACTTTTTAAAAAAAAAAAAACAAAAAAAAAAAAAAGGAATTCCGAGAG 
GGTGGCCTCAGAACCCAGGAAGTGGCCAAGGGGCACAQACTCTGCTGGAGGC^ 
TTTGCCTTAGGCCACAAAAAGAAGAAGGCTACCCCACTTACAGATGCAGA 
AGTGTTGGCAGCAGATGGTACTACTGAGCATGTCTACAGACCCA 
GACCa u ri<nX3CAGCGTOTATACCAGCCTTACX7rCACCACTTG^ 
CTATCGCCGTAGCCCTGGGGTGACTCCCGCAAGGCCTCGCTATGCTTGCTGCCC^ 
TGGAGCAGCAATATCCCAGCCTO»TGTGGGAATGGA« 
CTTGa^AGACAGATGT^TGA^^ 

ACAGCAGGAG 



180 

270 

360 

4S0 

540 

630 

720 

810 

900 

990 

1080 

1170 

1260 

1350 

1440 

1530 

1620 



ACAGCCTGGCCTCTCGGTCCACAGAGCATGGGCTACAAGATCCTG 
AC?TCCCACAAGACACCACTCCACCCACAGAGAGCCTAGGGGACCCATGG . 

gggcagatttgcaagtttacac^^ 

tcttagqaggwcctagqacttgggtatggggagtggggtcttg 
tgtctccatctttgtcat; 



1710 
1800 
1890 
1980 
2070 
2160 
2250 
2340 
2430 
2520 
2610 
2700 
2790 
2880 
2970 
3060 
3150 
3240 
3330 
3420 
3510 
3600 
3645 
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Figure 2 



Human DB1 dna and Protein S q nc s: 

10 20 30 40 50 60 

AGCGGGGGGAGTGGGGAGGAGGGGGGTCGGCCGCCGCAGCCATGGAGGCCAACTGGACCG 

M E A N W T> 

70 80 90 100 110 120 

CGTTCCTGTTCCAGGCCCATGAAGCTTCCCATCACCAACAGCAGGCAGCACAGAACAGCT 
AFLFQAHEASHHQQQAAQNS> 

130 140 150 160 170 180 

TGCTGCCCCTCCTGAGCTCTGCCGTGGAGCCCCCTGATCAGAAACCATTGCTTCCAATAC 
LLPLLSSAVEPPDQKPLLPI> 

190 200 210 220 230 240 

CAATAACTCAGAAACCTCAGGGTGCACCAGAAACATTAAAGGATGCCATTGGGATTAAAA 
PITQKPQGAPETLK DAIGIK> 

250 260 270 280 290 300 

AAGAAAAACCCAAAACTTCATTTGTGTGCACTTACTGCAGTAAAGCTTTCAGGGACAGCT 
KEKPKTSFVCTYCSKAFRDS> 

310 320 330 340 350 360 

ATCACCTGAGGCGCCACGAATCCTGCCACACAGGGATCAAGTTGGTGTCCCGGCCAAAGA 
YHLRRHESCHTGIKLVSRP K> 

370 380 390 400 410 420 

AAACCCCCACCACGGTGGTTCCCCTTATCTCTACCATCGCTGGGGACAGCAGCCGAACTT 
KTPTTVVPLISTIAGDSSR T> 

430 440 450 460 470 480 

r CGTT^GGTCTCGACCATTGCAGGCATCTTGTCAACAGTCACTACATCTTCCTCGGGCACCA 
"? :\?*...V* STIAGILSTVTTSSSGT> 

— *4?0 500 510 520 530 540 

ACCCCAGTAGCAGTGCCAGCACCACAGCTATGCCAGTGACCCAGTCTGTCAAGAAACCCA 
NPSSSAST TAMPVTQSVKKP> 

550 560 570 580 590 600 

GTAAGCCTGTCAAGAAGAACCATGCTTGTGAGATGTGTGGGAAGGCCTTCCGAGATGTGT 
SKPVKKNHACEMCGKAFRD V> 

610 620 630 640 650 660 

ACCATCTCAATCGACACAAGCTCTCCCATTCAGATGAGAAACCCTTTGAGTGTCCTATTT 
YHLNRHKLSHSDEKPFECPI> 

670 680 690 700 710 720 

GTAATCAGCGCTTCAAGAGGAAGGACCGGATGACTTACCATGTGAGGTCTCATGAAGGAG 
CNQRFKRKDRMTYHVRSHEG> 

730 740 750 760 770 780 

GCATCACCAAACCCTATACTTGCAGTGTTTGTGGGAAAGGCTTCTCAAGGCCTGACCACT 
GITKPYTCSVCGKGFSRPDH> 
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Figure 2 (con*t) 



790 800 810 820 830 840 

TAAGCTGTCATGTAAAACATGTCCATTCAACAGAAAGACCCTTCAAATGCCAAACGTGCA 
LSCHVKHVHSTERPPKCQTO 

850 860 870 880 890 900 

CTGCTGCCTTTGCCACCAAAGACAGACTGCGGACACACATGGTGCGCCATGAAGGCAAGG 
TAAFATKDRLRTHMVRHEG K> 

910 920 930 940 950 960 

TATCATGTAACATCTGTGGGAAGCTCCTGAGTGCAGCATACATCACCAGCCACTTAAAGA 
VSCNICGKLLSAAYITSHL K> 

970 980 990 1000 1010 1020 

CTCATGGGCAGAGCCAAAGTATCAACTGTAATACATGTAAACAAGGCATCAGTAAAACAT 
THGQSQS INCNTCKQ GISKT> 

1030 1040 1050 1060 1070 1080 

GCATGAGTGAAGAGACCAGTAACCAAAAGCAGCAGCAGCAGCAGCAGCAGCAACAACAAC 
CMSEETSNQKQQQ.QQQQQQQ> 

1090 1100 1110 1120 1130 1140 

AACAACAACATGTGACAAGCTGGCCAGGGAAGCAAGTAGAAACACTCAGACTGTGGGAAG 
QQQHVTSWPGKQVETLRLWE> 

1150 1160 1170 1180 1190 1200 

AAGCTGTTAAAGCAAGGAAGAAAGAAGCTGCTAACCTGTGCCAAACCTCCACGGCTGCTA 
EAVKARKKEAANLCQTSTA A> 

1210 1220 1230 1240 1250 1260 

CGAGACCTGTGACTCTCACTACTCCATTCAGTATAACATCCTCTGTGTCGTCTGAGACTA 
' r \?' { .T+fY- % V TLTTPFSITSSVSSET> 

• 1270 1280 1290 1300 1310 1320 

TGTCAAACCCAGTCACAGTGGCAGCTGCAATGAGCATGAGAAGTCCAGTAAATGTTTCAA 
MSNPVTVAAAMSMRSPVNVS> 

1330 1340 1350 1360 1370 1380 

GTGCAGTTAACATAACCAGCCCAATGAACATAGGGCATCCTGTAACTATAACCAGTCCAT 
SAVNITSPMNIGH PVTITSP> 

1390 1400 1410 1420 1430 1440 

TATCCATGACCTCTCCTTTAACACTCACTACCCCAGTCAACCTCCCCACCCCCGTCACTG 
LSMTSPLTLTTPVNLPTPV T> 

1450 1460 1470 1480 1490 1500 

CCCCAGTGAATATAGCACACCCTGTCACCATCACATCTCCAATGAATCTACCCACACCTA 
APVNIAHPVTITSPMNLPT P> 

1510 1520 1530 1540 1550 1560 

TGACATTAGCCGCCCCTCTCAATATAGCAATGAGACCTGTAGAGAGCATGCCTTTCTTGC 
MTLAAPLNIAMRPVESMPF L> 
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Figure 2 (con't) 



1570 1580 1590 1600 1610 1620 

CCCAAGCTTTGCCTACATCACCGCCTTGGTAAACAGTATTATAAAATCAAAATATGGGTA 
PQALPTSPPW*> 

1630 1640 1650 1660 1670 1680 

AAAGTAAATATTTACCAGCAACTTAACTTTTAGTTGATTAAAGCAAAAAGTAAACCATGA 

1690 1700 1710 1720 1730 1740 

AATTGGGAGATTTTATTACATTAGTTAATAAGAGTGTGGTAGCATTTTTCTCCAATTTGG 

1750 1760 1770 1780 1790 1800 

CTGGGATTATTCAAAGTAGGGTGTGTATGTAACTTATCACTGGACCACTTTAGTTTAATC 

1810 1820 1830 1840 1850 1860 

AGAAATTCCTTTTAGCTGACAACATTGCTTAAACAGGATAGTAGTTGGCAAGATGAAATG 

1870 1880 1890 1900 1910 1920 

CCAGAATTAAAACCAATCATAAGTAGAACCCACTTCAAAATAAAAAAACAGCATTACTAT 

1930 1940 1950 1960 1970 1980 

TTCTAATCCCAAGGAATCACTTTATTGTAAACACTAGCAGAACTCTTCTCCCTATACAAG 

1990 2000 2010 2020 2030 2040 

GTGGATGGCTGATTTTAACCTGAAATTTTAAATCCACAGATTGAGAGCTAGTGTAGAATT 

2050 2060 2070 2080 2090 2100 

GTCTGTGTTTATTGTTTTTATGAGTAAATACATGCATTGTCATAATAAAATGCATTTCAG 

2110 2120 2130 2140 2150 2160 

AGAATATGCAT.TTTACCTTTGGGAATATGTTAATTTCAGGCAGCATTCCCTATGGGAAAG 

' : " ' 2170 2180 2190 2200 2210 2220 

QTGATAC?eAGCTCTGATATGCAAAGCATATGATAATTTATCATTCTAACTTCAACGTATA 

2230 2240 2250 2260 2270 2280 

ATAGGGATTGTGACCTGATATTTGGAGATGTAAATATTGCTCAGCATATTAATCCCGATG 

2290 2300 
GAATATAGCATTGTAGTTGACTTTTT 
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Ffgur 3 



A. Retroviral 
Entrapment 
Vector supF 




EcpRI 



LTR 



B. Infection of ES cells 
Isolation of G41 8 r clones 



C. Screen for AP expression 
In embryofd bodies 

\ a ex ^ ^ ^ / — > 



Undifferentiated 
ES cells 



Embryoid bodies Terminally differentiated 
I embryoid bodies 

T 

D. Selection of ES cell clones with 
differential AP expression in vitro 



^..Aggregation chimeras 
Transfer into pseudo- 
pregnant foster females 



F. AP expression in 
day-8.5 to 9.5 embryos 




G. Cloning of 

flanking cellular DNA 
by supF complementation 
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Figur 4 
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Figur 15 



A: Jackson BSS Chromosome 2 



Abl 

Psmb7 
Acvr2a 



4534 2 2 1 1 1 1 4 3 



R 
£13 

£13 
ai3 



SE 
1.49 
1.49 
1.49 



1.06 1.06 
1.06 1.06 
7.45 Z71 



B: Jackson BSS Chromosome 2 

9 
■ 

•4- D2MH1, D2B\r2 

- - D2Mit4, D2Mit31 

- - Jund2, Mpmv3 

- - D2MM78, D2MH464 

- - 02M/tf 

- - 

- - D2Bir5, Pax8 

- - Lhx3, Notchl, Vezfl 

- - Abl, Bng, Pbx3 
-f- Psm67 

I 5 CM 



- - Acvr2a, D2MH155 
■+- Ctsb-rs, D2MH8 
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Figur 16 

Restriction Enzyme Map of a 20 kb Genomic DNA of the Vezfl Gene 

2kb 

3' 



, r i iv On 



4 




Xh 

AT© 



1 5 kb Ea 3 kb 

BamHI (B), EcoRI (E), EcoRV (E5), Eagl (Ea), Notl (N), Patl (P), Sad (S), 
Xbal (Xb), and Xhol (Xh). 



— Intronic sequence; 
[T]Exon 1 
[2] Exon2 
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Figure 17 

Vezfl EXPRESSION VECTORS 



CMV-Vezfl 
Ti 2-Vezf 1 



CMV/ 
Tte2 



Vezfl cDNA IRES AP SV40 



■..T^ 2 .'. , . ■ ™ m . intron+p(A > 

mill ^nnnmiMM 



CMV-AZF-Vezf1 ^ . 309 992 

Ti 2-AZF-Vezfl 



[^VJ;l;li ! ilii;i;B 
r.v.v.v.'J « ; j ! 1 1 1 : t ' ■ : 'B 



CMV-Vezfl -ZF , , 309 992 

TI 2-Vezf1-ZF I I l^^ii-iiiiiiiii 



CMV-AP 
TI 2-AP 
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